A Scalable Communication-Induced Checkpointing Algorithm for Distributed Systems

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Checkpointing Algorithm for Distributed Systems Implementing Reliable Communication Channels

This paper presents a new checkpointing algorithm for systems using reliable communication channels. The new algorithm requires O(n + m) communication messages, where n is the number of participating processes, and m is the number of “ late” messages. The algorithm is non-blocking, requires minimal message logging, and has minimal stable storage requirements. This algorithm is also scalable, si...

متن کامل

A Non-blocking Checkpointing Algorithm for Distributed Systems

The technology of checkpointing and rollback recovery as an effective method of fault tolerance, has been used widely on the parallel or distributed computer systems. We have presented a nonblocking coordinated checkpointing algorithm for distributed systems, which are differ from the conventional approach of taking first temporary checkpoints and then converting them to permanent ones by proce...

متن کامل

An Efficient Checkpointing Algorithm for Distributed Systems Implementing Reliable Communication Channels

This paper presents a new checkpointing algorithm that guarantees the semantics of reliable communication channels despite the crash and recovery of processes. This algorithm requires O(n + m) communication messages, where n is the number of participating processes, and m is the number of “ late” messages. The algorithm is nonblocking, requires minimal message logging, and has minimal stable st...

متن کامل

An Index-Based Checkpointing Algorithm for Autonomous Distributed Systems

This paper presents an index based checkpointing algorithm for distributed systems with the aim of reducing the total number of checkpoints while ensuring that each checkpoint belongs to at least one consistent global checkpoint or recovery line The algorithm is based on an equivalence relation de ned between pairs of successive checkpoints of a process which allows in some cases to advance the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEICE Transactions on Information and Systems

سال: 2013

ISSN: 0916-8532,1745-1361

DOI: 10.1587/transinf.e96.d.886